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FIELD OF THE INVENTION 

The present invention relates to a video communication system, which is able to 
5 receive a digital video signal, according to the preamble of claim 1. The invention further 
relates to a method for processing a digital video signal according to the preamble of claim 
8. 

Such a system may be used, for example, for 3D video applications within MPEG 
standards. 

10 

BACKGROUND OF THE INVENTION 

A video communication system typically comprises a transmitter with an encoder 
and a receiver with a decoder. Such a system receives an input digital video signal, encodes 
said signal via the encoder, transmits the encoded signal to the receiver, then decodes the 

15 transmitted signal via the decoder resulting in an output digital video signal, which is the 
reconstructed signal of the input digital video signal. The receiver then displays said output 
digital video signal. A digital video signal comprises some sets of objects, which are 
characterized by components such as shapes, textures, motion information, disparity map 
(in case of 3D video signal) etc. 

20 When an object Is encoded, the components of said object are encoded. The 

encoding and decoding processes of each video component can more or less depend on 
each other. Coding performance rely on this inter-dependence. 

Let us give an example with 3D video objects encoded using MPEG-4 standard. In 
this document referred under the MPEG-4 document number w3056 at ISO and entitles 

25 "Information Technology - Coding of audio-visual objects - Part 2: Visual, ISO/IEC JTC 1/SC 
29 /WG 11, Maui, December 1999", when one encodes a stereo video sequence comprising 
two views, one view - say the left one - and the corresponding disparity maps (or depth 
maps) are encoded. The right view is reconstructed by projecting the left view using the 
disparity map. Parts of the right view that were not in the left one (occlusion parts) cannot 

30 be reconstructed and holes remain; In order to reconstruct properly the right view, one 
encodes the occlusion parts retrieved from the original right view as an MPEG-4 video 
object, whose shape corresponds to the holes. If the left view is encoded at a bit rate of 3.5 
Mbit/s, the enhancement layer defined to handle occlusion parts should not be larger than 
10% of this bit-rate. Here it takes 340 kbit/s. Occlusion objects are defined by shape and 

35 texture. Using the standard way of coding objects, their bit-rate costs are: 
Shape: 93 kbit/s i.e. 27% of the bit-rate allocated to occlusion 
Texture: 128 kbit/s i.e. 37% of the bit-rate. 



The occlusion shapes can be determined from the disparity map. Still, components encoding 
being interdependent, it is not possible to efficiently encode the texture of the occlusion 
parts, without encoding the shapes. Therefore, the bit rate cost of the video sequence Is not 
optimal. 

SUMMARY OF THE INVENTION 

AeeordinglyritHs-an-ebjeetHDR^ 

as defined in the preamble of claim 1 and a method as defined in the preamble of claim 8, 
which lower the bit rate needed to encode objects. 

To this end, according to a first object of the invention, there Is provided a video 
communication system as claimed in claiml. 

In addition, according to a second object of the invention, there is provided a 
method as claimed in claim 8. 

As we will see in detail further on, by giving the possibility to not encode a first 
component and therefore not transmit it while encoding the other components as if it was 
the case, the encoding efficiency will be improved, as less bits will be necessary to encode 
the whole object. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Additional objects, features and advantages of the invention will become apparent 
upon reading the following detailed description and upon reference to the accompanying 
drawings in which: 

- Fig. 1 illustrates a video communication system comprising an encoder and a decoder 
according to the invention, 

- Fig, 2 is schematic diagram of the method of encoding used by the encoder of the video 
communication according to the invention, and 

- Fig. 3 is schematic diagram of the method of decoding used by the decoder of the video 
communication according to the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

In the following description, well-known functions or constructions by the person 
skilled in the art are not described in detail since they would obscure the invention in 
unnecessary detail. 

The present invention relates to a video communication system for processing a 
digital video signal. 
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Such a system, depicted in Fig. 1, may be used for video applications in MPEG2 or 
MPEG4, wherein said video communication system comprises a transmitter TRANS, a 
transmission medium CH and a receiver RECEIV. Said transmitter TRANS and said receiver 
RECEIV comprise an encoder ENC and a decoder DEC respectively. 

5 

In order to transmit efficiently some video signals through the transmission medium 
CH, the transmitted bits of the video signals being known as bit-stream BIT_STR, said 
encoder ENC applies an encoding on a video signal, then the encoded video signal is sent to 
a decoder DEC, which decodes said signal. Finally the receiver RECEIV displays said video 
10 signal 

A video signal comprises some sets of objects OBJ with different components COMP 
such as shape, texture, motion vectors, disparity map, colors etc. 

When the encoder ENC encodes an object OBJ, It encodes effectively all the 
components COMP of said object OBJ. A lot of encoding of components is dependent from 
15 other components. For example, in INTER mode scheme, well known by the person skilled in 
the art, texture information can be used only if we have motion information. When it comes 
to video object coding, in MPEG-4 using block based principle, texture blocks positions are 
determined by shape, and knowing this shape enables to improve coding efficiency by using 
spatial redundancy between co-located texture blocks. 

20 

The encoder ENC comprises an availability information FLAG for determining if a 
component COMP of an object is tp be encoded, or not, encoding means for encoding said 
component COMP if said availability information FLAG is enabled, and transmission means 
for transmitting said component COMP of said object OBJ if the availability Information FLAG 
25 Is enabled. 

The decoder DEC comprises decoding means for decoding a component COMP if the 
availability information FLAG is enabled and retrieving means for retrieving said a 
component COMP if the availability Information FLAG is disabled. 

30 The encoding of an object OBJ is done by the encoder ENC as following as 

illustrated in Fig. 2. 

For each component COMP of the object OBJ, the encoder ENC decides If said 
component COMP will be included In the bit-stream BIT_STR or not. Decision depends on 
the type of video application: for example, in a stereo system where one encodes the 
35 occlusion part, one knows that there Is no need to encode and transmit the shape of these 
occlusions. 



Availability information FLAG is associated to each component COMP of the object 
OBJ. If a component COMP is to be included In the bit-stream, i.e. encoded and transmitted, 
the availability information FLAG is enabled. Preferentially, this availability information FLAG 
is placed at video object level (VO in MPEG-4). 

In a first non-limitative embodiment, this availability information FLAG has two 
values: 1, when enabled and 0 when disabled. 

In^-seOTn(hron-limlta8ve-embedime 

extended syntax: 1, when enabled and 0 plus a description codeword when disabled. The 
description codeword tells how to retrieve the missing component COMP (wait for extra 
input, like a pre-computed shape, wait for extra input and n* component Information, etc.). 
Afterwards, the decoder DEC will use this description codeword to retrieve a component 
COMP. It is supposed that at the decoder side, the way to retrieve a missing component 
COMP is well known and retrieval algorithms are available. 

Then, the encoder ENC encodes all the components COMP (step ENC_COMP in Rg. 
2) that will be included in the bit-stream BITJSTR and for the components COMP_D, which 
need the missing components COMP_M, it encodes them as if all the components COMP of 
the object OBJ and especially the missing components COMP_M have been used, encoded 
and transmitted as well. Indeed, some of the coding components may require the use of 
other components as described before. 

Note that the availability information FLAG and potentially description codeword 
(depending on the kind of availability information is used in the application) are defined for 
every of the object components COMP. The encoded video object OBJ corresponds then to 
all these availability information FLAG, along with the Included encoded components COMP. 
Hence, the bit-stream BIT_STR that is transmitted to the decoder comprises the availability 
information FLAG of the missing components, and for every other components, their 
corresponding availability information FLAG and the encoded component COMP Itself. 

Note that the following hypothesis has been made. We suppose we have an 
encoding algorithm where video objects OBJ fall into several components and that they 
correspond to different, separable parts of the bit-stream BIT_STR. It means that it is 
possible to replace one component without preventing the decoding process of another 
component. For example, in MPEG-4, if we encode a video sequence with texture and depth 
information, if we replace or modify the encoded texture Information, it still possible to 
decode depth. Furthermore, if we change the motion vector components, it is possible to 
decode the texture - even though we have a sequence that is visually different from the 
original, but the encoded texture information Is still correct. 
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Said encoded bit-stream BITJSTR is then sent to the decoder DEC of the receiver 
RECEIV, via the transmission medium CH. 

5 The decoding of the encoded bit-stream BIT_STR is done by the decoder DEC as 

following as illustrated in Fig. 3. 

When receiving the encoded bit-stream BIT_STR, the decoder DEC first checks the 
availability information FLAG in said bit-stream BIT_STR. If availability information FLAG is 
set to 1, it waits for the corresponding components COMP in the bit-stream BITJSTR, which 

10 follow their associated availability information FLAG. Then, it decodes them. It decodes first 
the components COMP that do not need any other encoded components or missing 
components COMP_M (step DECCOMP on the Fig. 3), then the components COMP_D that 
are dependent from the other encoded components and from the missing components 
COMPJvi. For these later dependent components COMP__D, the decoder DEC retrieves the 

15 missing components COMP_M (step RETR_COMP_Ml on the Fig. 3) and then decodes those 
dependent components COMPJD (step DEC_COMP_D on the Fig. 3). The dependence 
relation between components is application dependent: in a stereo video decoder, we could 
say that when decoding the occlusion shape, one uses the decoded disparity map - this 
could be done by introducing a description codeword meaning "Retrieved from Disparity". To 

20 take a more general case, In MPEG coding scheme, texture information is retrieved from 
motion vector and a previous frame. 

Once all of the encoded components COMP are decoded, the decoder DEC retrieves 
the last missing components COMP_M (step RETR_COMP_M2 on the Fig. 3). Note that this 
last step can be also done at any other time, for example, at the beginning, 

25 The bit-stream BIT.STR is as if none of the components had been removed. The 

decoding process, well known by the person skilled in the art, is possible, as if every 
component had been regularly encoded and transmitted. 

Thus, one advantage of the present invention is to simply decrease the bit rate used 
30 to encode an object by only using availability information FLAG, and to simply informing the 
decoder that it needs to retrieve itself some components. 

It is to be understood that the present invention Is not limited to the aforementioned 
embodiments and variations and modifications may be made without departing from the 
35 spirit and scope of the invention as defined in the appended claims. In the respect, the 
following closing remarks are made. 
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It Is to be understood that the present invention is not limited to the aforementioned 
video application. It can be used within any application using a system for processing a 
signal that is decomposed into several components, the encoding of which results in 
separated bit-stream parts. In particular, the invention applies to video compression 
5 algorithms of the other MPEG standards family (MPEG-1, MPEG-2) and to the ITU H26X 
family (H261, H263 and extensions, H26I being the latest today, reference number Q15-K- 
59 y — 

It is to be understood that the method according to the present invention is not 
10 limited to the aforementioned Implementation. 

There are numerous ways of implementing functions of the method according to the 
invention by means of items of hardware or software, or both, provided that a single item of 
hardware or software can carries out several functions. It does not exclude that an assembly 
of items of hardware or software or both carry out a function, thus forming a single function 
15 without modifying the method for processing the video signal in accordance with the 
invention. 

Said hardware or software items can be implemented in several manners, such as by 
means of wired electronic circuits or by means of an integrated circuit that Is suitable 
programmed respectively. The integrated circuit can be contained in a computer or in a 

20 video system communication. In the second case, the video system communication 
comprises encoding means for encoding a component of an object if the availability 
information is enabled, decoding means for decoding said component of said object if the 
availability information is enabled, and, retrieving means for retrieving said component of 
said object if the availability information is disabled, as described previously, said means 

25 being hardware or software Items as above stated. 

The integrated circuit comprises a set of instructions. Thus, said set of instructions 
contained, for example, In a computer programming memory or in a video system 
communication may cause the computer or the video system communication to carry out the 
different steps of the encoding method. 

30 The set of instructions may be loaded into the programming memory by reading a 

data carrier such as, for example, a disk. A service provider can also make the set of 
Instructions available via a communication network such as, for example, the Internet. 

Any reference sign in the following claims should not be construed as limiting the 
35 daim. It will be obvious that the use of the verb "to comprise" and Its conjugations do not 
exclude the presence of any other steps or elements besides those defined in any claim. The 




article "a" or "an" preceding an element or step does not exclude the presence of a plurality 
of such elements or steps. 
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CLAIMS 

A video communication system (SYS), which Is able to receive a digital video signal, said 
digital video signal comprising some set of objects (OBJ) with components (COMP), 
comprising an encoder (ENC) for encoding said video signal, a transmission channel 
(CH) for transmitting the encoded video signal and a decoder (DEC) for decoding said 
encoded video signal, characterized In that it comprises: 

- An availability information (FLAG) for determining if a component (COMP) of said 
object (OBJ) is to be encoded at the encoder (ENC) side and transmitted via the 
transmission channel (CH), 

- Encoding means atthe encoder side (ENC) for encoding a component (COMP) of 
said object (OBJ) if the availability information (FLAG) is enabled, 

- Decoding means at the decoder (DEC) side for decoding said component (COMP) of 
said object (OBJ) if the availability information (FLAG) is enabled, and, 

- Retrieving means at the decoder (DEC) side for retrieving said component (COMP) 
of said object (OBJ) if the availability information (FLAG) is disabled. 

A video communication system (SYS) as claimed in daim 1, characterized in that the 
encoding means are adapted to encode a component (COMP) as if all the components 
(COMP) of said object (OBJ) have been encoded and transmitted as well. 

A video communication system (SYS) as claimed in claim 1, characterized in that an 
encoded component is dependent (COMP_D) from another component (COMP), said 
another component being an encoded component or a missing component (COMP_D). 

An encoder (ENC) for encoding a digital video signal, said digital video signal comprising 
some sets of objects (OBJ) with components (COMP), characterized in that It comprises: 

- An availability information (FLAG) for determining if a component of said object 
(OBJ) is to be encoded and transmitted via a transmission channel (CH), and 

- Encoding means for encoding a component (COMP) of said object (OBJ) if the 
availability Information (FLAG) is enabled. 

An encoder (ENC) as claimed in claim 4, characterized In that the encoding means are 
adapted to encode a component (COMP) as if all the components (COMP) of said object 
(OBJ) have been encoded and transmitted as well. 
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6. A decoder (DEC) for decoding a digital video signal, said digital video signal comprising 
some sets of objects (OBJ) with components (COMP), characterized in that it comprises: 

- Decoding means for decoding a component (COMP) of said object (OBJ) if an 
availability information (FLAG) is enabled, and 

5 - Retrieving means for retrieving a component (COMP_M) of said object (OBJ) if an 

availability Information (FLAG) is disabled. 

7. A decoder (DEC) as claimed in claim 6, characterized in that the decoding means are 
adapted to decode first a component (COMP) that is not dependent from another 

10 component (COMP), then to decode a component (COMP_D) that Is dependent from 

another component, and if dependent from a missing component (COMP_M), to decode 
said dependent component (COMP_D) after the retrieving of said missing component 
(COMP_M). 

15 8. A method for processing a digital video signal, said digital video signal comprising some 
set of objects (OBJ) with components (COMP), characterized in that it comprises the 
steps of: 

- By a virtue of an availability information (FLAG), determining if a component (COMP) 
of said object (OBJ) is to be encoded and transmitted via a transmission channel 

20 (CH), 

- Encoding said component (COMP) of said object (OBJ) if the availability information 
(FLAG) is enabled and transmitting it, 

- Decoding said component (COMP) of said object (OBJ) if the availability information 
(FLAG) is enabled, and 

25 - Retrieving said component (COMP_M) of said object (OBJ) if the availability 

information (FLAG) is disabled. 

9. A method for processing a digital video signal as claimed In claim 8, characterized in that 
the encoding of a component (COMP) is performed as if all the components (COMP) of 

30 said object (OBJ) have been encoded and transmitted as well. 

10. A computer program product for a computer, comprising a set of instructions, which, 
when loaded into said computer, causes the computer to carry out the method in claims 
8 to 9. 



35 
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Method and system for processing a digital video signal 
ABSTRACT 

The present invention relates to a video communication system (SYS), which is able to 
receive a digital video signal, said digital video signal comprising some set of objects (OBJ) 
with components (COMP). The invention is characterized in that it comprises: 

- An availability information (FLAG) for determining If a component (COMP) of said 
object (OBJ) is to be encoded and transmitted via the transmission channel (CH), 

- Encoding means for encoding a component (COMP) of said object (OBJ) if the 
availability Information (FLAG) Is enabled, 

- Decoding means for decoding said component (COMP) of said object (OBJ) if the 
availability information (FLAG) is enabled, and, 

- Retrieving means for retrieving said component (COMP) of said object (OBJ) If the 
availability information (FLAG) is disabled. 



Use: MPEG video communication system 

Reference: Fig. 2 
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